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Abstract 

We survey a new paradigm in signal processing known as "compressive sensing". 
Contrary to old practices of data acquisition and reconstruction based on the Shannon- 
Nyquist sampling principle, the new theory shows that it is possible to reconstruct 
images or signals of scientific interest accurately and even exactly from a number of 
samples which is far smaller than the desired resolution of the image/ signal, e.g., the 
number of pixels in the image. This new technique draws from results in several fields 
of mathematics, including algebra, optimization, probability theory, and harmonic 
analysis. We will discuss some of the key mathematical ideas behind compressive 
sensing, as well as its implications to other fields: numerical analysis, information 
theory, theoretical computer science, and engineering. 

1 Introduction 

Compressive sensing ||45llll9l is a new concept in signal processing where one seeks to 
minimize the number of measurements to be taken from signals while still retaining 
the information necessary to approximate them well. The ideas have their origins 
in certain abstract results from functional analysis and approximation theory f79[ l92t 
but were recently brought into the forefront by the work of Candes, Romberg and 
Tao IfTSl [l5l [T2| and Donoho 14511 who constructed concrete algorithms and showed 
their promise in application. 

Sparse approximation has been studied for nearly a century, and it has numerous 
applications. Temlyakov [111] locates the first example in a 1907 paper of Schmidt 
II104L In the 1950s, statisticians launched an extensive investigation of another sparse 
approximation problem called subset selection in regression |[87l and recently least 
angle regression ||54illl3L Later, approximation theorists began a systematic study 
of wj-term approximation with respect to orthonormal bases and redundant systems 
11381 ITTTH and very recently in l25ll^ . 

Over the last decade, the signal processing community spurred by the work of 
Coifman et al. ||28l|29| and Mallat et al. HHlZllMl has become interested in sparse 
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representations for compression and analysis of audio Il72]| , images Il63l and video 
1901 . Sparsity criteria also arise in deconvolution [1101, signal modeling fTOO], pre- 
conditioning [74J, machine learning |70|, de-noising [22], regularization |33, 35| and 
error correction [16, 19, 60, 58, 59, 61 J. Most sparse approximation problems employ a 
linear model in which the collection of elementary signals is both linearly dependent 
and large. These models are often called redundant or overcomplete. Recent research 
suggests that overcomplete models offer a genuine increase in approximation power 
[95 1 .621. Unfortunately, they also raise a serious challenge. How do we find a good 
representation of the input signal among the plethora of possibilities? One method 
is to select a parsimonious or sparse representation. The exact rationale for invok- 
ing sparsity may range from engineering to economics to philosophy. At least three 
justifications are commonly given: 

1. It is sometimes known a priori that the input signal can be expressed as a short 
linear combination of elementary signals also contaminated with noise. 

2. The approximation may have an associated cost that must be controlled. For 
example, the computational cost of evaluating the approximation depends on 
the number of elementary signals that participate. In compression, the goal is to 
minimize the number of bits required to store the approximation. 

3. Some researchers cite Occam's Razor, "Pluralitas non est ponenda sine necessi- 
tate" (causes must not be multiplied beyond necessity). 

Sparse approximation problems are computationally challenging because most 
reasonable sparsity measures are not convex. A formal hardness proof for one im- 
portant class of problems independently appeared in [88 J and [36 J. A vast array of 
heuristic methods for producing sparse approximations have been proposed, but the 
literature contains few guarantees of their performance. The pertinent numerical tech- 
niques fall into at least three basic categories: 

1. The convex relaxation approach replaces the nonconvex sparsity measure with 
a related convex function to obtain a convex programming problem. The convex 
program can be solved in polynomial time with standard software [8], and one 
expects that it will yield a good sparse approximation. More on that will be said 
in the sequel. 

2. Greedy methods make a sequence of locally optimal choices in an effort to pro- 
duce a good global solution to the approximation problem. This category in- 
cludes forward selection procedures (such as matching pursuits), backward se- 
lection and others. Although these approaches sometimes succeed ||3T1 167[ [69l 
im [nHUllZl [1181, they can also fail spectacularly ||40l|22l. The monographs of 
Miller | |87| and Temlyakov [111] taste the many flavors of greedy heuristic. 

3. Specialized nonlinear programming software has been developed that attempts 
to solve sparse approximation problems directly using, for example, interior 
point methods [96 1. These techniques are only guaranteed to discover a locally 
optimal solution though. 

Several problems require solutions to be obtained from underdetermined systems of 
linear equations, i.e., systems with fewer equations than unknowns. Some example 
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Figure 1: When Fourier coefficients of a testbed medical image known as the 
Logan-Shepp phantom (top left) are sampled along 22 radial lines in the fre- 
quency domain (top right), a naive, "minimal energy" reconstruction setting 
unobserved Fourier coefficients to is marred by artifacts (bottom left), ii- 
reconstruction (bottom right) is exact. 

of such problems arise in linear filtering signal processing, and inverse problems. For 
an underdetermined system of linear equations, if there is any solution, there are in- 
finitely many solutions. In many applications, the "simplest" solution is most accept- 
able. Such a solution is inspired by the minimalist principle of Occam's Razor. For 
example, if the parameters of a model are being estimated then among all models that 
explain the data equally well, the one with the minimum number of parameters is 
most desirable. 

The notion that sparse signals-meaning signals with a small number of nonzero 
coefficients for a given basis (and no noise) one can (with high probability) be recon- 
structed exactly via -minimization is not exactly new. The idea was first expressed 
in 1986, by Fadil Santosa and William Symes [103J. But the full extent of the theory, 
including the robustness of the reconstruction procedure, is only now coming into full 
focus. One of the champions of this approach, David Donoho, coined the term "com- 
pressed sensing" to emphasize the fact that £i -minimization is not just a new way of 
massaging a "complete" set of measurements into a compact form, but rather a new 
way of thinking about how to measure things in the first place |45|. 

This new way of thinking has profoundly practical implications. Making mea- 
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surements can be expensive, in terms of time, money, or (in the case of, say, x-rays) 
damage done to the object being imaged. Compressive sensing has the potential to 
provide substantial cost savings without sacrificing accuracy. In one impressive nu- 
merical experiment, Candes, Romberg, and Tao | |T2| showed that a 512 x 512-pixel 
test image, known as the Logan-Shepp phantom, can be reconstructed exactly from 
512 Fourier coefficients sampled along 22 radial lines-with, in other words, more than 
95% of the ostensibly relevant data missing (see Figure 1). 

A host of practical applications are now being explored, including new sensing 
techniques, new analog-to-digital converters, and a new digital camera with a single 
photon detector, being developed by Kevin Kelly, Richard Baraniuk and the Digital 
Signal Processing group at Rice (dsp.rice.edu/ cs/ cscamera) Iil07ul21)| . 

2 Mathematical foundations 

2.1 Sparsity and undersampling 

The celebrated Nyquist-Shannon-Whittaker sampling theorem shows that a signal 
with bandwidth 2Q is completely determined by its uniform samples if and only if the 
samples are taken at least at the Nyquist rate Q/tt. This principle used to underlie all 
signal acquisition techniques used in practice, such as consumer electronics, medical 
imaging, analog-to-digital conversion and so on. Compressive sampling puts forward 
a novel sampling paradigm that replaces the notion of band-limited signals with that 
of sparse signals. This new notion allows for dramatically "undersampled" signals to 
be captured and manipulated using a very small amount of data. The point of this 
section is to explain the basic mathematics behind this new theory. 

Suppose X is an unknown vector in (a digital image or signal). We plan to 
sample x using n linear functionals of x and then reconstruct. We are interested in 
the case n <^ N, when we have many fewer measurements than the dimension of the 
signal space. Such situations arise in many applications. For example, in biomedical 
imaging, far fewer measurements are typically collected than the number of pixels 
in the image of interest. Further examples are provided by virtually any domain of 
science or technology where amounts of data are very large and costs of observa- 
tion/ acquisition/ measurement are nontrivial. 

The measurements y/^ are obtained by sensing x against n vectors (pj^ G R^. Thus 
yjt = (x, (pj^) for = 1, . . . , n, or, equivalently 

y = Ox (1) 

for some n x N measurement/sensing matrix O. Thus we arrive at an underdetermined 
system of linear equations, which, as is well known, in general has infinitely many 
solutions, so our problem is ill-posed. But suppose that our signal x is sparse or com- 
pressible, i.e., that is (essentially) depends only on a small number of degrees of free- 
dom. To give a first impression of the theory, we in fact assume that the signal can be 
written exactly as a linear combination of only a few basis vectors. 

Mathematically the problem can be formulated as follows. Given a matrix O S 
j^nxN many more columns than rows (n <C N), and a vector y G R", find a 
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Figure 2: ii minimization 



vector X G K, with a minimum possible number of nonzero entries, i.e., 

minimize II X II subject to <i>x = y (2) 

where ||x||o is the number of nonzero entries of x Il30|l . By allowing noise (e > 0), we 
obtain a variation of the problem 

minimize II X II subject to ||^x — i/||2 < £, (3) 

These problems per se are NP-hard even for e = 0, see ||65l[88ll . 

The classical, well studied, approach would be to minimize the 2-norm ||x||2 in 
the above problems, but this usually yields a solution vector x that is full, while for a 
sparse representation we would like to find a vector x with few nonzero entries. 

The main approach taken in compressive sensing is to minimize the 1-norm ||x||i 
instead. 

minimize ||x||i subject to Ox = y, (4) 

and 

minimize II X 111 subject to ||Ox — i/||2 < e, (5) 

respectively ||22| , where ||x||i := E;|^/| (See Figure 2). Very surprisingly, the £i min- 
imization yields the same result as the io minimization in many cases of practical 
interest. This phenomenon was initially observed by engineers and geophysicists, 
most notably Claerbout and Logan as early as 1970s (see |110|), and by Santosa and 
Symes in 1986 |103|, as mentioned in the introduction. In the last five years or so, a 
series of papers [l^SllllSlllEllZllHEIMll^llQSJ explained why £i minimization can 
recover sparse signals in a variety of practical setups. In our next section, we give a 
few sample theorems about this remarkable phenomenon. 
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Finally, the £i minimization problem can be efficiently solved by convex program- 
ming or by linear programming (LP) [81. Most compressive sensing results due to 
Candes, Donoho, Romberg, Tao and others [411 SH |45l |25l [151 are all based on 
this method (see also l,125t 11261 1127| ). Other approaches include greedy algorithms, 
for instance, the so-called matching pursuit introduced by Mallat and Zhang |j84[ |9Tl 
111411115 1. Recently many variations on matching pursuit have been proposed, among 
which are orthogonal matching pursuit 19111891 , stagewise orthogonal matching pur- 
suit ||51J , gradient pursuit [6J, and others. 

2.2 Incoherence and restricted isometry 

Given an n x N matrix O, the first basic question is to determine whether O is good 
for compressive sensing, i.e., will lead to good recovery of sparse solutions to the 
equations = y. 

Candes and Tao fT0l-|j20l introduced a necessary condition that guarantees an es- 
timate of its performance on classes of sparse vectors. 

Definition ( ||l9l[l2[[l3ll ). A matrix O is said to satisfy the Restricted Isometry Property 
(RIP) of order k with constant 5 := 5^ e if 

(1-4)11^111 < ll^^lli < + (6) 

for any x such that \\x\\q < k. 

It is straightforward to see that this condition can be reformulated as follows: Con- 
sider n X #T matrices <I>r formed by the columns of O with indices in the set T. Then 
the Gramian matrices 

Gt := O'^Or 

are bounded and boundedly invertible on I2 with bounds as in ((6]|, uniform for all 
T of size #T = k. Since each matrix Gj is symmetric and normegative definite, this 
is equivalent to each of these matrices having their eigenvalues in the interval [1 — 
Sk, 1 + S]^]. The role played by RIP becomes clear from the following result of Candes 
and Tao. 

Theorem 1 ( |[T8l[Tl1l ). If the n x N matrix satisfies RIP of order 3k for some 5 S (0, 1), then, 
for any vector x S R^, the ii minimization problem @ has a solution x* such that 

\\x-x II2 < C - " ^ / (7) 

where Xy. denotes the best k-sparse approximation to x and C denotes a constant. 

The condition ((Tj) means that the O's with higher values of k for which RIP is satis- 
fied perform better in compressive sensing. For example, if an n x N matrix O has the 
restricted isometry property of order k, then its performance in /2-norm on the unit 
ball of l^ is of order C/ y/k, and the optimal performance is achieved if O satisfies RIP 
of order k = 0(n/ log(N/ n)) 13911 . This is indeed achieved via various probabilistic 
constructions l[?Tll42ll4Sll44ll4g||. 
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A primary matrix measure related to RIP is mutual incoherence (22llll8iril4llll5l| : 

:=max|(0^0),-,|, 

i.e., the maximum inner product of distinct columns of O. Since the columns of O are 
usually normalized to be of 2-norm 1, the mutual incoherence of a matrix is between 
and 1. 

This notion can be generalized f77\ as follows: For a given normalized matrix 
O G R"^'^, its k-mutual incoherence M.k{<i>) is defined by 

Mki^) :=maxmax |(<I>cOs);;|. 

#S<k i^i ' 

The mutual incoherences M.^ are intimately related to the best constant 5]^ with which 
the matrix O satisfies RIP of order k, but a full understanding of this connection has 
not been reached I77l[78ll . 

A challenging aspect of RIP is its computational cost. Indeed, RIP is a property 
of the submatrices of a specific size. At present, no subexponential-time algorithm 
is known for testing RIP. Introducing other matrix measures may potentially help in 
effectively verifying the RIP or finding other, less demanding conditions for sparse 
recovery. 

One such weaker condition has been introduced by Cohen, Dahmen and DeVore 
in |l25ll . To motivate their condition, we first recall that a pair (O, A) where O is a 
sensing matrix and A is a decoder, is called instance-optimal of order k for a normed 
space (y, II ■ if there exists an absolute constant C such that 

||x - A(Ox)||y < C\\x - x^Wv- 

The matrix O has the null space property in V if 

||^||y < c||x — X|c||y for all x such that Ox = 0. 

The importance of the null space property can be seen from the following result: 

Theorem 2 ( ||25ll26]| ). Given ann x N matrix O, a norm \\ ■ \\v dnd a value k, the instance 
optimality in V with constant Cq is equivalent to the null space property of(t> of order 2k with 
the constant Co/2 in the sufficiency part and the same constant Co in the necessity part. 

Note that the null space property is preserved under row operations on the matrix 
<I> since, as its name suggests, it is simply a property of its null space. This property is 
therefore less rigid than the RIP and may allow for a more efficient verification. 



2.3 Compressible signals 

In practice, most signals may not be exactly sparse in a given basis but may concen- 
trate near a sparse set. In fact, the most commonly used models in signal processing 
assume that the coefficients of the signal with respect to, say, a wavelet basis, decay 
rapidly away from their essential support. Smooth signals, images with bounded 
variation and those with bounded Besov norm are known to be of that type. 
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Given a nearly sparse signal x, denote by Xj^ its best /c-sparse approximation, i.e., 
the vector obtained by keeping the k largest coefficients of x and discarding the rest. 
Candes, Romberg and Tao [12J showed that the initial signal can be recovered with 
error of order 1 1 x — 1 1 1 / whenever the sensing matrix satisfies RIP of order 4/c and 
the RIP constants S^k and 64]^ are not too close to 1. 

Theorem 3 (UHl). Let <I> satisfy RIP of order with d^k + ^S^k < 2. Then, for any signal 
X, the solution x* to @ satisfies 

||,.-,||,<C^t^, 

with a well-behaved constant C. 

A similar result holds l|T2| for stable recovery from imperfect measurements, i.e., in the 
setting of problem (jS)). All together, this indicates that £1 minimization stably recovers 
the largest k coeffients of a nearly /c-sparse vector even in the presence of noise. 

This result is in fact optimal for important classes of signals: Let x belong to 
the weak-£p ball or radius R, i.e., let the decreasing rearrangement of its coefficients 
1^1(1) > 1^1(2) ^ ■ ■ ■ ^ 1^1 (N) satisfy the condition 

< R-ri/p, / = 1,...,N. 

This can be shown to imply 

\\x - XkWi <CR- k^l^-^lv and ||x - x^||i < C ■ R ■ k^-^'v 

for some constant C. Moreover, for generic elements in weak-£p, no better estimates 
are obtainable. In other words, i\ recovery achieves an approximation error roughly 
as small as the error obtained by deliberately selecting the k largest coefficients of the 
signal. 

2.4 Good sensing matrices 

Most sampling algorithms developed so far in compressive sensing are based on ran- 
domization [17, 45|. Typically, the sensing matrices are produced by taking i.i.d. ran- 
dom variables with some given probability distribution and then normalizing their 
columns. Such matrices are guaranteed to perform well with very high probability, 
i.e., with the failure rate exponentially small in the size of the matrix |45|. Following 
QUI, we mention three random constructions that are by now standard. 

Random matrices with i.i.d. entries. Consider the matrix O with entries drawn 
independently at random from the Gaussian probability distribution with mean 
zero and variance 1/n. Then ITSl |42| . with overwhelming probability, the l\ 
minimization (|4]| recovers fc-sparse solutions whenever 

k < const ■ n/ log(N/n). 
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Fourier ensemble. Let O be obtained by randomly selecting n rows from the N x N 
discrete Fourier transform and renormalizing the columns so that they have 2- 
norm 1. If the rows are selected at random, then |15| as above, with overwhelm- 
ing probability, the £i minimization (HJ recovers /c-sparse vectors for 

k < const - n/ (log N)^. 

General orthogonal ensembles. Suppose O is obtained by selecting n rows from 
an N X N orthonormal matrix U and renormalizing the columns to be of unit 
length. In the rows are selected at random, then |15| /c-sparse recovery by ii 
minimization ^ is guaranteed with overwhelming probability provided that 

1 n 

k < const ■ . ^, -jz — rr-. 

M^{U) (logN)'^ 

Note that the Fourier matrix U satisfies Ai{U) = 1, so this is a generalization of 
the Fourier ensemble. 

The natural problem already being addressed by several authors is how to achieve 
robust deterministic constructions of good CS matrices. Tao in |109| points out the im- 
portance of this problem, as well as its similarity to other derandomization problems 
from theoretical computer science and combinatorics. Several deterministic construc- 
tions are currently known (see, e.g., ||39H76|). However, the performance of matrices 
provided by these deterministic constructions is not yet on a par with that of matrices 
arising probabilistically. 

To give several examples, DeVore in ||39| proposes a construction of cyclic matri- 
ces using finite fields that satisfy RIP of order k for k < C^/n log n / log (N/n), which 
falls short of the above-mentioned range k < Cn/\og{N/n) known for probabilis- 
tic constructions. Indyk in |76| and Xu and Hassibi in 11123 II propose another scheme 
for compressive sensing with deterministic performance guarantees based on bipar- 
tite expander graphs. Another flavor of randomness is introduced in f2] where ran- 
dom Toeplitz matrices are constructed with entries drawn independently from a given 
probability distribution. 

2.5 Optimality and w-widths 

The performance of the best sensing matrices ^, which is presently achieved by ran- 
dom matrices with probabilistic guarantees, yields recovery of fc-sparse vectors using 
n samples (so that the matrix O is n x N) provided that 

k < const • n/ log(N/n). 

In particular, a fc-sparse vector can be recovered, say, by random projections, of di- 
mension 0{k ■ log(N/fc)) [41J. 

For signals x in the weak-£p ball of radius R, £i recovery gives the error ||45]| 

\\x* -x\\2 < const- R ■ (n/log(N/n))"^/P+^/2. 
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It turns out that this performances cannot be improved even by using possibly adap- 
tive sets of measurements and reconstruction algorithms. 

The matter turns out to be closely related to the issue of the so-called Gelfand 
widths [92J known from approximation theory: For a class T , let (^) be the best 
reconstruction error from n linear measurements 

£„(^):=infsup||/-D(y)||2, y = ^f, 

where the infimum is over all sets of n linear functionals and all reconstruction algo- 
rithms D. The error E„{J^) is essentially equal |92| to the Gelfand width of the class 
T defined as 

dn{T) :=inf{sup \\Pvf\\ : codim(y) < n}, 

where Py is the orthogonal projector on the subspace V. Gelfand widths are known for 
many classes of interest. In particular, Kashin [79J, Garnaev and Gluskin |[66| showed 
that the Gelfand widths for the weak-^p ball of radius R satisfy 



for some universal constants c and C. 

This shows that the recovery provided by compressive sensing techniques is in fact 
optimal for weak-ip norms in spite of being completely non-adaptive 1251 [T0]| . This is 
one more indication of the great potential of compressive sensing in applications. 



3 Connections with other fields 
3.1 Statistical estimation 

Candes fTO\ and Donoho f45^ point out a number of connections of compressive sens- 
ing with ideas from statistics and coding theory. We briefly mention main ideas here. 

In statistical estimation, the signal is assumed to be measured with stochastic er- 
rors 

y = <i>x + z 

where z is a vector of i.i.d. (independent identically distributed) random variables 
with mean zero and variance cr^. Very often, z is assumed to be Gaussian. The problem 
is again to recover x from y. 

One seeks to design an estimator whose accuracy depends on the information con- 
tent of the object x. The Dantzig selector \20\ estimates x by solving the convex program 

minimize II f 111 subject to sup | (0"^r),| < A(7 

i 

for some A > 0, where r is the residual r:=y — Ox. These ideas are very close to the 
so-called lasso approach |113ll54| . 
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Analogously to ii minimization in compressive sensing, the Danzig selector was 
shown f20^| to recover sparse and compressible signals with the number of measure- 
ments much smaller than the dimension of x and within a logarithmic factor of the 
ideal mean squared error one would only achieve with an oracle supplying perfect 
information which coordinates are nonzero and which are above the noise level. 

3.2 Error-correcting codes 

In coding theory ||60ll58ll59ll6T1l , a vector x is transmitted to a remote receiver. The in- 
formation X is encoded using an n x N matrix C with n <C N. Gross errors may occur 
during transmission, so that a fraction of the entries of Cx is completely corrupted. 
The location and the damage done to those entries are unknown. It turns out that 
a constant fraction of errors with arbitrary magnitudes can still be corrected fl9\ by 
solving a suitable linear minimization problem. In fact, known methods recover the 
vector X exactly provided the fraction of the corrupted entries is not too big |l20l[l0| . 

3.3 Frame theory 

The theory of compressive sensing matrices closely resembles the basic theory of 
frames |l23l l32l l83l I124j . A countable collection of elements {fi}iei is a frame for a 
Hilbert space H if there exist constants < A < B < oo (the lower and upper frame 
bound) such that, for all g E H, 

MgfH<L\{sJM'<B\\gfH. 

A frame is called tight if the upper and lower bounds are the same A = B. A frame 
is bounded if inf/gj ||/;||h > (the condition sup^^j WfiWn < ^ follows automatically 
from the definition of a frame). A hame is unit norm if = Iforall/ G I. If 

is a frame only for its closed linear span, it is called a frame sequence. A family {fi}i<=i 
is a Riesz basic sequence for H if it is a Riesz basis for its closed linear span, i.e., if, for 
some constants < A < B < oo and for all sequences of scalars {c,},ej, 

iGl iGl ii=I 

The analogy with the restricted isometry property is obvious, however, the latter is 
imposed only on submatrices formed from the original matrix. 

This analogy must be worth pursuing in both directions, i.e., looking for applica- 
tions of the theory and methodology of compressive sensing to frames and vice versa. 
Randomization techniques from compressive sensing could be of particular interest 
in attacking problems from frame theory (cf. ||4l lll6| ). 

4 Practical implications 

Compressive sensing, and more generally the possibility of efficiently capturing sparse 
and compressible signals using a relatively small number of measurements, paves the 
way for a number of possible applications. 
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Figure 3: The scheme of the CS camera 

Data acquisition. New physical sampling devices may be designed that directly 
record discrete low-rate incoherent measurements of the analog signal. This 
should be especially useful in situations where large collections of samples may 
be costly, difficult or impossible to obtain. 

Data compression. The sparse basis in which the signal is to be represented may 
be unknown or unavailable. However, a randomly designed O is suitable for 
almost all signals. We stress that these protocols are nonadaptive to the signal 
and simply require to correlate it with a small number of other fixed vectors. 

Inverse problems. The measurement system may have to satisfy rigid constraints 
such as in MR angiography and other MR setups, where O records a subset of 
the Fourier transform. However, if a sparse basis exists that is also incoherent 
with <I>, then efficient sensing is possible. 

A particularly interesting example of successful implementation of compressive sens- 
ing methodology is provided by a digital camera newly developed by Richard Bara- 
niuk and Kevin Kelly at Rice University (see dsp.rice.edu/ cs/ cscamera) |107l ll2l| . 

In the detector array of a conventional digital camera, each pixel performs an 
analog-to-digital conversion; for example, the detector on a 5-megapixel camera pro- 
duces 5 million bits for each image. This large amount of data is then dramatically 
reduced through a compression algorithms (using wavelet or other techniques) so as 
not to overburden typical storage and transfer capacities. 

Rather than collect 5 million pixels for an image, the new camera samples only a 
factor of about four times the 50,000 pixels that the jpg compression might typically 
output. These 200,000 single-pixel measurements provide an immediate 25-fold sav- 
ings in data collected compared with 5 megapixels. 

The camera developed at Rice replaces the CCD array with a digital-micromirror 
device (DMD). A sequence of random projections is performed on the micromirror ar- 
ray, so that the image "bounces off" of each random pattern in the sequence, and the 
reflected light from each pattern is collected sequentially with a photodiode sensor 
that acts as the single-pixel detector (see Figure 3). After taking a sequence of essen- 
tially time-multiplexed measurements, a specific £i minimization algorithms decodes 
the picture out of the collected sequence of single-pixel measurements. 
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